A Time-Warping Pitch Tracking Algorithm Considering Fast f0 Changes
نویسندگان
چکیده
Accurately tracking the fundamental frequency (f0) or pitch in speech data is of great interest in numerous contexts. All currently available pitch tracking algorithms perform a short-term analysis of a speech signal to extract the f0 under the assumption that the pitch does not change within a single analysis frame, a simplification that introduces errors when the f0 changes rather quickly over time. This paper proposes a new algorithm that warps the time axis of an analysis frame to counteract intraframe f0 changes and thus to improve the total tracking results. The algorithm was evaluated on a set of 4718 sentences from 20 speakers (10 male, 10 female) and with added white and babble noise. It was comparative in performance to the state-of-the-art algorithms RAPT and PRAAT to Pitch (ac) under clean conditions and outperformed both of them under noisy conditions.
منابع مشابه
High Resolution Speech F0 Modification
The present paper proposes a new algorithm for pitch modification which is convenient for changing the fundamental frequency of speech with so fine resolution that is at least comparable with human pitch perception. Using the proposed method, measurements of just noticeable changes on speech prosody becomes possible. High resolution F0 manipulation is completed without explicit over-sampling of...
متن کاملMultiple Fundamental Frequency Extraction for Mirex
This extended abstract outlines an efficient approach for the extraction of multiple fundamental frequencies (F0) from polyphonic musical audio. The algorithm consists of three analysis steps. At first a multi-resolution spectral analysis is performed on the audio signal. Then, the most salient pitches are identified using a pitch extraction algorithm, which is designed to identify the predomin...
متن کاملOptimal Current Meter Placement for Accurate Fault Location Purpose using Dynamic Time Warping
This paper presents a fault location technique for transmission lines with minimum current measurement. This algorithm investigates proper current ratios for fault location problem based on thevenin theory in faulty power networks and calculation of short circuit currents in each branch. These current ratios are extracted regarding lowest sensitivity on thevenin impedance variations of the netw...
متن کاملGeneration of Fundamental Frequency Contours of Mandarin in HMM-based Speech Synthesis using Generation Process Model
The HMM-based speech synthesis system can produce high quality synthetic speech with flexible modeling of spectral and prosodic parameters. In this approach, short term spectra, fundamental frequency (F0) and duration are generated by multi-stream HMMs separately. However the quality of synthetic speech degrades when feature vectors used in training are noisy. Among all noisy features, pitch tr...
متن کاملMulti methods pitch tracking
The elaboration of rather large spontaneous speech corpora frequently implies the collection of data recorded with poor acoustic quality which may affect its acoustic analysis, and particularly fundamental frequency tracking (F0). Indeed, F0 analysis is particularly sensitive to distortion due to low signal to noise ratio, filtering of low frequencies, encoding in compressed formats (mp3, wma, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017